Degraded Character Recognition
نویسنده
چکیده
The DCR application for Degraded Character Recognition was developed for the DEA (Diplôme d’Études Approfondies)’s thesis. The main objective is to recognize characters, degraded by an acquisition from a lowresolution camera. Several steps are needed and are detailed in this thesis from the thresholding step, which converts a gray-level picture in a black and white one, to the post-correction step, which adds linguistic information to improve recognition results. Several sophisticated methods are used, such as wavelet decomposition or neural networks for different purposes. A different view is given here to understand how steps before the recognition part can induce changes on results.
منابع مشابه
Multiple classifier for degraded machine printed character recognition Multiple classifier for degraded machine printed character recognition
The general problem of optical character recognition (OCR) remains a fundamental but not entirely solved issue in document analysis. In spite of significant improvements in the area of optical character recognition, the recognition of degraded printed characters, in particular, is still lacking satisfactory solutions. This paper presents an OCR method that combines the Hopfield network with a s...
متن کاملGrayscale Feature Combination in Recognition based Segmentation for Degraded Text String Recognition
Grayscale feature is very effective for degraded character recognition. While many papers focus on different feature extraction algorithms on single character recognition, few deals with the impact of the selected feature on segmentation. For recognition-based segmentation, a good recognition performance on single character may not always have good performance on segmentation. In this paper, tw...
متن کاملA Study of Touching Characters in Degraded Gurmukhi Text
Character segmentation is an important preprocessing step for text recognition. In degraded documents, existence of touching characters decreases recognition rate drastically, for any optical character recognition (OCR) system. In this paper a study of touching Gurmukhi characters is carried out and these characters have been divided into various categories after a careful analysis. Structural ...
متن کاملRecognizing Degraded Handwritten Characters
In this report, a character recognition system is proposed that handles degraded manuscript documents which were discovered at the St. Catherine’s Monastery. In contrast to state-of-the-art Ocr systems, no early decision, namely the image binarization, needs to be performed. Thus, an object recognition methodology is adapted for the recognition of ancient manuscripts. Therefore, interest points...
متن کاملImproved Degraded Document Recognition with Hybrid Modeling Techniques and Character N-Grams
In this paper a robust multifont character recognition system for degraded documents such as photocopy or fax is described. The system is based on Hidden Markov Models (HMMs) using discrete and hybrid modeling techniques, where the latter makes use of an information theory-based neural network. The presented recognition results refer to the SEDAL-database of English documents using no dictionar...
متن کاملDegraded Document Analysis and Extraction of Original Text Document: An Approach without Optical Character Recognition
Document Image Analysis recognizes text and graphics in documents acquired as images. An approach without Optical Character Recognition (OCR) for degraded document image analysis has been adopted in this paper. The technique involves document imaging methods such as Image Fusing and Speeded Up Robust Features (SURF) Detection to identify and extract the degraded regions from a set of document i...
متن کامل